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Chapter 6 Implicit function theorem 





Chapter 5 has introduced us to the concept of manifolds of dimension m contained in R”. 
In the present chapter we are going to give the exact definition of such manifolds and also 
discuss the crucial theorem of the beginnings of this subject. The name of this theorem is the 
title of this chapter. 

We definitely want to maintain the following point of view. An m-dimensional manifold 
M C R" is an object which exists and has various geometric and calculus properties which 
are inherent to the manifold, and which should not depend on the particular mathematical 
formulation we use in describing the manifold. Since our goal is to do lots of calculus on M, 
we need to have formulas we can use in order to do this sort of work. In the very discussion 
of these methods we shall gain a clear and precise understanding of what a manifold actually 
is. 





We have already done this sort of work in the preceding chapter in the case of hypermani- 
folds. There we discussed the intrinsic gradient and the fact that the tangent space at a point 
of such a manifold has dimension n — 1 etc. We also discussed the version of the implicit 
function theorem that we needed for the discussion of hypermanifolds. We noticed at that 
time that we were really always working with only the local description of M, and that we 
didn’t particularly care whether we were able to describe M with formulas that held for the 
entire manifold. The same will be true in the present chapter. 


A. Implicit presentation 

We go back to the idea of implicitly defined manifolds introduced briefly on p. 5-26. That 
is, we are going to have n — m real-valued functions on R”, and the manifold will be (locally) 
the intersection of level sets of these functions. Let us denote 





k= n=: 


this integer is frequently called the codimension of M. We are assuming 1 < m < n—1, 
so that 1 < k <n-—J1. (Hypermanifolds correspond to k = 1.) Let us call the “constraint” 
functions g1,..., 9%. Then we are proposing that M be described locally as the set of all points 
x € R” satisfying the equations 














g(x) 0, 
ope). = 10. 


Of course, since we want to do calculus we assume at least that each g; € C'. But more is 
required. We must also have functions which are independent from one another. For instance, 
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it would be ridiculous to think that the two equations 


Qr? + 274227 =2, 





define a 1-dimensional manifold in R*! 

But more than the qualitative idea of independence is required. Even in the case n = 2 and 
k = 1 we have seen that a single C1? equation g(x 1,22) = 0 does not necessarily give us a true 
manifold; see p. 5-2. We have learned in general that significant treatment of hypermanifolds 
(the case k = 1) requires that the gradient of the defining function be nonzero on M. The 
proper extension of this to our more general situation is that the gradients Vg; be linearly 
independent on M. This is the quantitative statement of the independence of the defining 
conditions. 

Here are four examples with n = 3 and k = 2 which are in the spirit of p. 5-2, and are not 
1-dimensional manifolds. 
EXAMPLE 1. This is a rather artificial example, that comes from the example x? — y? = 0 
on p. 5-2. Namely, let 


n(z,y,z) =z+a%—-y’, 
g2(z, Y, 2) = 2. 


The set given as g; = go = 0 is just the curve x? — y? = 0 in the x — y plane. This is not a 
manifold. Though 
Vo = (327, —2y, 1) and V92 = (0, 0, 1) 


are never zero, they are linearly dependent at the point (0,0, 0). 
EXAMPLE 2. A similar example, but one which is more geometrically significant, is given 
by 

n(2,y,2) =zy-z, 

g2 (a, Y, z) = %. 
The set g; = go = 0 is just given as xy = 0 in the x — y plane, and this is the intersection of 
the two coordinate axes. This is not a manifold, the origin being the “bad” point. Again, 


Vo = = L) and V92 — (0, 0, 1) 


are never zero, but are linearly dependent at the origin. 
The nice surface g; = 0 is called a hyperbolic paraboloid. The nice surface gz = 0 is the 
x —y plane. So this result shows that two nice surfaces may intersect in a bad “curve.” 
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EXAMPLE 3. Here is a rather obvious example, but also it illustrates the point. Two 
spheres in R? may intersect in a single point. Thus the intersection is not a 1-dimensional 
manifold. 





Notice that it is geometrically clear that the two relevant gradients are linearly dependent at 
the bad point. 

EXAMPLE 4. Here is quite an elegant example. We form the intersection of a certain 
sphere and right circular cylinder. Here are a “top” and 3-D view of the defining equations: 


xe ayr4zrad 


Xx +y?-y = 0. 





Notice that the gradients are 
(2x, 2y,2z) and (2z,2y — 1,0), 


and these are linearly dependent at (0,1,0). So we expect “trouble” near (0,1,0). Indeed, if 
we look at this intersection from far on the positive y axis and project onto the x — z plane 
(like the tangent plane to the sphere and the cylinder), we see this equation: 


ge + (1-2? —2") -—V1—22-27 = 0; 
solving this for x, 


la? = Vlj=e = 2% 
(l—27)? = 1-2? -2’; 


= gag: 
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This picture shows clearly the nonmanifold nature of the intersection of these two surfaces. 


PROBLEM 6-1. Show that the length of the intersection curve just discussed is 
equal to 
1 = 32 
4 | 2), 
9 V1l—2? 


DEFINITION. Let 1 <m<n-—1 and let M C R” be a set that we want to present as 
an m-dimensional manifold. Let k = n—m. Suppose that in a neighborhood of any point 
xo € M there are C' functions gi,...,g, such that 

(1) near a,c1€ M = Q(x) = Oforl<i<k, 
and 

(2) Vo.(2o),---,; V9x(ao) are linearly independent. 
Then we say that M is implicitly presented. 

This condition of linear independence can be put in the form of a Jacobian matrix, as on 
p. 2-54. Namely, write the defining functions as a column vector 




















gi(x) 
gz)=] 3 |], 
gn (2) 
so that R” 4 R*. Then 
Ogi/Ox, ... O91/O%n 
Dg(x) = : 
Og /Ox, ... Ogn/OXn 


is the Jacobian matrix of g. It is a k x n matrix, and our condition is that at 7 = x9 its rows 
are linearly independent. 
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In the context of matrix algebra, the largest number of linearly independent rows of a 
matrix A is called the row rank of A. Likewise for column rank. A relatively simple matrix 
algebra theorem asserts that always row rank = column rank. This is proved in the next 
section. Thus we would say that an m x n matrix has maximal rank if its row rank = column 
rank = min(m, 7). 

In these terms our condition is that the row rank of Dg is k. As k < n, the condition can 
be rephrased by saying that 

Dg has maximal rank. 


Moreover, it is not hard to see that if the matrix Dg(xzp) has maximal rank, then so does 
Dg(«) for all x in some neighborhood of a9. This is such simple yet important linear algebra 
that we pause for a discussion of this material. 


B. Rank 


We work with matrices with real entries, although the identical things hold for matrices 
with complex entries, or rational entries, etc. 





DEFINITION. Let A be an m x n matrix and let the columns of A be ay,...,an, € R™: 
A= (04 tig. 22% Wy): 


The column rank of A is the dimension of the vector space spanned by the columns of A. This 
vector space is called the column space of A. The row space and row rank are defined similarly. 
We can now immediately prove the theorem alluded to above: 


THEOREM. Row rank = column rank. 


PROOF. Let ¢ denote the column rank of A. Then the column space has dimension @. 
Therefore there exist ¢ vectors b;,...,b¢ € R™” such that each a; is a linear combination of the 
form 





l 
i= S cede; 1S 5 St 
k=1 
(By the way, the b;,’s are linearly independent and thus the c,;’s are unique, but none of that 


matters for the proof!) Rewrite these vector equations as equations for their i** components: 


£ 


aig = S binces, 1 < 1 < m. 
k=1 
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(We recognize this as a matrix product 


A= BC) 


mxn mxkkxn 


Now we simply reinterpret these equations as representing the rows of A: 


e 
(Gi1, Qi2,.-+, Qin) = So bin(cer, Cininy en) 1<i<m. 
k=1 
These equations show that the row space of A is spanned by the @ vectors (cg1,..-, Cen). Thus 


the row rank of A is no greater than @. 
We have therefore proved that 


row rank of A < column rank of A. 


Now just note that replacing A with A‘ interchanges row rank and column rank. 
QED 


DEFINITION. § rank(A) = column rank of A = row rank of A. 


PROBLEM 6-2. Let A be a real m X n matrix. Prove that A has rank > @ => 
there exist rows numbered 7; < 72 < --- < zg and columns numbered j; < jo <--: < je 
such that the square matrix B with entries 


bag = Viagg 


has nonzero determinant. 

(HINT: use the row rank to find @ rows which are linearly independent. Discard the other 
rows to get an @ x n matrix A’ with row rank @. Now use the fact that A’ has column 
rank @.) 


PROBLEM 6-3. Prove that if the real m x n matrix A has rank > @, then all real 
m Xn matrices whose entries are sufficiently close to those of A also have rank > @. In 
particular, if A has maximal rank, then all nearby matrices also have maximal rank. 


PROBLEM 6-4. _ In the definition of Section A show that the second condition may 
be replaced with 
(2) Vagi(x),...,Vge(a) are linearly independent for all x in some neighborhood of Zo. 
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PROBLEM 6-5. What connection is there among the numbers rank(A), rank(B), 
and rank(AB)? 


PROBLEM 6-6. Let A be an m Xx n matrix. Prove that the following conditions are 
equivalent. 

a. rank(A) = 1. 

b. A= (a;b;) where not all a; are zero and not all b; are zero. 

c. A= BC where B is a nonzero m x 1 matrix and C is a nonzero 1 x n matrix. 














PROBLEM 6-7. Let a, b be nonzero vectors in R”, written as n x 1 matrices (column 
vectors). Prove that the eigenvectors of the rank 1 matrix ab‘ are the scalar multiples 
of a together with the orthogonal complement of b, and that the eigenvalues are ae b, 
oa gl): 


PROBLEM 6-8. — Continuing with Problem 6-7, prove that the characteristic poly- 
nomial of ab’ is (—1)"\""!(A — ae b). (Cf. Problem 3-41.) 


C. Implicit function theorem 


We have already discussed a version of this theorem in Section 5E. That version pertained 
to the case of hypermanifolds, in which k = 1 and there is just one function for which the 
equation g(x) = 0 needs to be “solved.” The present situation calls for a direct generalization. 
The hypothesis and conclusion are by now easy to guess, and the easy guess is correct. 





THEOREM. Suppose 1 < k < n—1, and suppose R" > R* is of class C! in a neighborhood 
of xo, and 














g(Xo) = 0. 
Suppose the Jacobian matrix Dg(xo) has maximal rank (equal to k). Then matrix algebra 
asserts that there are k columns of Dg(xq) which are linearly independent. For ease in writing, 


suppose these are the last k columns: columns m+ 1 through n. Write points in IR” in the 
symbolic fashion of R" = R™ x R* as 























eS "Bis easy Cia Bigs oss On) 
= (aa). 
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CONCLUSION: there are k uniquely determined functions 








RP. Leak 




















of class C! near the point x, such that for all x in some neighborhood of xo, 
92) = 0 St,6 = ole) jor 1<7kk. 


Moreover, the functions yp; are as differentiable as g: if g is of class C*, then all y; are of 
class. 


The conclusion of this theorem asserts that we have succeeded in “solving” the system of 
equations g(x) = 0 for k of the coordinates in terms of the other m coordinates. It’s again as 
though we have & equations in k “unknowns” (%m41,-..,;%m), which we are able to solve. 

As mentioned already on p. 5-16, we do not prove this theorem in this course. 

Its relevance is just as before: it is exactly the theorem we need to understand the manifold 
M determined by the implicit presentation 


g(x) 0, 


0. 


9x(X) 


The essential nature of this theorem is that it guarantees that the linear approximation 
information contained in the Jacobian matrix Dg(xo) is decisive in describing the nonlinear 
situation contained in the function g near x9, provided that the rank condition holds. The 
prototype is the basic calculus situation y = f(x) in which we want to find an inverse function 
x = f-'(y). If f is affine it’s elementary provided f’ 4 0. And if f’ # 0, then we are 
guaranteed a local solution x = f~'(y) exists, though we don’t have an explicit formula. 


D. Results 


We suppose that the manifold  C R” satisfies everything required in Section B. We 
are now in a position to reap the many wonderful benefits of the implicit function theorem 
(“manifold” benefits). 


TANGENT SPACE. We use the exact definition of Section 5F. Thus T’,,/ is the set of all 
7/(0), for C* curves lying in M with (0) = vo. The same sort of theorem as in Section 5F 
gives the implicit representation of T),,/. Namely, 





heT,,M => Vg(to)eh = 0 forall L<i<k. 
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Since the normal vectors Vg;(xo) are linearly independent, this shows that T),,.M is a vector 
space of dimension m, as we would expect and desire. 


INTRINSIC GRADIENT. If M & R is defined only on M, then as before we can prove 
that there is a unique vector Wf (xq) in the tangent space T,,,M such that for all C1 curves 
in M with 7(0) = 2o, 





lt) = VF(00) ©7/(0) 











The other view point is that if R” , R is defined in the ambient space, then Wf(zo) is the 
orthogonal projection of the ambient gradient V f(z) onto the tangent space. That is, there 
are unique scalars A1,..., Ax such that 


Vf (xo) = VF(vo) — AV gil) 


belongs to TM. 
Of course we continue to call Wf (9) the intrinsic gradient of f with respect to the manifold 
M. 


LAGRANGE MULTIPLIERS. An intrinsic critical point of f is a point x79 € M such that 
Vf (xo) = 0. In other words, there exist scalars A,,..., Ax, known as Lagrange multipliers, such 
that 


k 
i=1 
In practice this works in the usual way. There aren +k “unknowns”: 21,...,%n3A1,---,Ak3 


and there are n+ k equations they have to satisfy — the n equations above together with the 
k; constraints 


gi(2) 0, 


I 
2 
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PROBLEM 6-9. Find the intrinsic critical points of z+ + y* + z+ on the manifold 
gt+ytet=letytz=l. 


(Solution: First, we note that this is a 1-manifold: we can recognize it as a “small circle” on 
the unit sphere containing the points 7, 7, and k, or we can see that the gradients (2x, 2y, 2z) 
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and (1,1,1) are linearly independent (since no point (s, s,s) belongs to the set). We may 
therefore use Lagrange multipliers. After adjusting constants, we have 


x = \,0 4+ ro, 
y° = Ayy + Ao, 
a =\,z+9, 
e+yt+2? =1, 
ertytz = 1 


We already know x = y = z to be impossible. Thus one coordinate must differ from the other 
two. We treat the case z 4 x and z # y. Then eliminate A, to obtain 


Divide by x — z and y — z, respectively: 


g?+ez2+27 =), 
yi tyzt+2? =r. 
Subtract: 


g? —y? +22 — yz = 0; 
(x —y)(a@t+ytz) =0; 


xr—y=O0. 
Thus « = y. The constraints imply 
Ite =, 
20 => 2 =I 


Thus 


Qxn* + (1-22)? = 1; 
6a? — 4x = 0; 
$=0 or 2/2. 
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We thus find (0,0,1) and (2/3,2/3,—1/3). By symmetry, the function has six intrinsic 
critical points: 


(1,0,0), (1/3, 2/3, 2/3), 
(1,0); +(278—1/3)2/2), 
0:1), -(2/3,2/3,—1/2),. 


The three on the left are maxima (f = 1) and the three on the right are minima 


(f = 11/27).) 


PROBLEM 6-10. Using the Lagrange method, find the intrinsic critical points of 
the function f(z, y, z) =z on the “great circle” 27+ y?+27=1,2+y+2z=0. 


(Solution: We have 


1 = \yx4+ Ao, 
0 = Aiyt rs, 
0 = 12+ ra, 
e+y+2* =1, 
GARY Zz = 0. 


Thus subtraction gives 1 = \; (a — y) = A(x — z). Thus ; 4 0 and we conclude y = z. The 
constraints give x? + 2y? = 1, x = —2y. Thus 6y? = 1 and we obtain two critical points: 


ee eee 


v6 


PROBLEM 6-11. Find the extreme values of the distance between the origin (0, 0, 0) 
and points of the first octant portion of the curve 


fy 2 =4, 
LYZ =a. 
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PROBLEM 6-12. Consider the ellipse 
FE: 


and the straight line 


Find a necessary and sufficient condition on the semiaxes a and b such that EF and L have 
no points in common. Given this condition, find the minimum distance between points 
of EF and L. Do this by using Lagrange multipliers to find the minimum of the function 


D(a, £2, 41, ya) = (#1 - yi) + (a2 - yo)”, 


where x and y are subject to the appropriate constraints. 


PROBLEM 6-13. Continuing with the preceding problem, show that in addition 
to the minimum, there is another intrinsic critical point of the function D. What is the 
nature of this other critical point (minimum, local maximum, or saddle point)? 


PROBLEM 6-14. Here is a generalization of Problem 6-12. Let A be ann x n 
symmetric positive definite matrix and let u be a nonzero vector in R”. Consider the 
ellipsoid 

E={a¢ €R"| Arezr=1} 
and the hyperplane 

L={¢ €R"|uer=}}. 


a. Prove that EN L=@ <> Atueu <1. 
b. Find the minimum of ||x — y|| for x € E, y € L. 
(HINT: principal axis theorem.) 


PROBLEM 6-15. Let n > 3 and find the maximum value of 57", x? on the sphere 
lIcl| =1, Via ei = 1. 
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PROBLEM 6-16. (This problem is from Mathematical Intelligencer, vol 16, no. 4 (1994), p.8.) 
A circle is inscribed in a face of a cube. Another circle is circumscribed about a neighboring 
face of the cube. Find the least and greatest distances between the points of the circles. 


Please be kind to the grader standardizing the cube to have length 2 and be centered at the origin with sides parallel to the coordinate axes: 


{~ (x,y,1) with x2+y=1 
ee . 














~ 


(1,u,v) with u2+v2=2 





PROBLEM 6-1 yaar Here is a problem from American Mathematical Monthly, Volume 101, Number 6, June - 
July, 1994, proposed by Murray S. Klamkin, University of Alberta, Edmonton, Alberta, Canada: 
Determine the extreme values of 





fl 1 ‘: 1 i 1 
2, Y, 2 U,0,w) = 
i lte2tu ltytv lt+zt+w 


= 43 = 73 
where xyz = a”, uvw = 6b’, and gz, y, Zz, u, v, w > 0. 
Clearly, 0 < f < 3. The function f is defined on a 4-dimensional manifold M contained in R°. If you let x + 0, y ~ 0, u + co, v > co on 
this manifold, you see that f assumes values arbitrarily close to 0. Thus f does not attain a minimum on M; rather, we say 





inf f =0. 
inf f 


Also if you let x — 0, y 0,u—->0,v 0 on M, you see that f assumes values arbitrarily close to 2. Therefore, 





2<supf <3. 
M 


Now here is my version of Klamkin’s problem: 
1. Find all the intrinsic critical points of f on M. 
2. Calculate sup f. 
M 


3. Investigate whether or not f attains the value sup f at some point of M. 
M 
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PROBLEM 6-18. Find all the intrinsic critical points of the function f(x) = 7+ y 
on the manifold described intrinsically as x? + y? + z7 = 49 and xyz = 24. Also identify 
the local nature of each of these intrinsic critical points. 


Cartesian product of manifolds. The three preceding problems provide an illustration of 
a very general construction of manifolds. In Problem 6-14, for instance, we are really dealing 
with the Cartesian product 

















ExL={(z,y)|ceF,yeL}cR"xR". 








Thus E x L is a manifold of dimension 2n — 2 contained in R2”, and the problem asks that 











we consider the function R2” 4 R defined by 


f(x,y) = |lv — yl]. 





The general situation is quite easily understood. If M; C R™ is a manifold of dimension 
m, for i = 1, 2, then M, x Mz is a manifold of dimension m, + mz in R™*”". We saw an 
excellent example of these ideas in Section 5G where we introduced the flat torus. 





E. Parametric presentation 


We have seen many examples of this way of viewing a manifold. For instance, curves in 




















R” were actually defined this way on p. 2-3: as continuous functions R , R". The variable 
t in the expression x = f(t) is often called a parameter. In terms of manifolds, we are more 
interested in the image of f, 








M = {f(t) te R}, 


and we need to make the usual sort of assumption that f is of class C' and f’(t) 4 0. 
Furthermore, f needs to be one-to-one in order to have just one parameter value for each 
point of the curve. The fact that t € R is what gives the image M its dimension of 1. 

In the case of the tori of Section 5G, we have seen parametric presentations of each involving 
two parameters. The geometric idea is quite clear in general. We have a one-to-one function 

















R” 4 R” 








where the functional relation may be expressed 
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We are anticipating an image 





M = {F(t)|teR™} 
which is an m-dimensional manifold. The idea is that the function F’ maps the “flat” parameter 


space R™ in a “curvy” manner into R”. The straight coordinate lines of R™ themselves are 
transported to curves which form a kind of “curvilinear” grid on M: 






































aE 
at, 


R” 


dF 


We shall require an important condition to insure that this provides an actual smooth manifold. 
That is, we require that the vectors in R” given as 

OF OF 

bn a 
be linearly independent...lest the tangent lines in the curvilinear grid be tangent to one 
another. 














The reasoning we have just given seems quite clear. If we imagine t; as varying while all 
the other ¢;’s remain fixed, the function F(t) gives a curve from R into R”, and the curve lies 
in the manifold M. Thus its derivative OF'/Ot; is tangent to M. Therefore if xp = F'(to) is a 
fixed point of M, then 




















OF OF 


—(to),...,=— 2M. 
a, | 0); » Bp (to) = Tx5M 
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We demand T;,, VM be an m-dimensional space, and thus the assumption of linear independence 
of the vectors OF'/Ot; is quite natural. 





EXAMPLE. Spherical coordinates on the unit sphere in R*. We begin with the usual 
spherical coordinates for R*®, which we describe with a sketch: 





hy 


(x, y, 2) 











The formulas are 


x =rsinycos8, 
y =rsingsing, 
Zz =frcosy. 











Here we leave out the z-axis in R*, and r is the norm of the vector (2, y, z); thus 0 < r < oo. 
We stress that 0 < y < 7 only, but the usual polar coordinate @ is determined only up to 
additive integer multiples of 27. 





Warning. Many books denote the norm r by p. What is more troublesome is that many 
books, especially in physics, have the letters y and @ interchanged. A little care needs to be 
taken to make sure of the notation used in any material, and this is only a minor inconvenience. 














The unit sphere M C R? is of course described by the restriction r = 1. Therefore we 
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obtain spherical coordinates y, 6 for the sphere itself, the formulas being of course 


x =sinycosd, 
y =singsing, 
Z =cosy. 
Besides the ambiguity in 6, the restriction 0 < y < 7 leaves out the north pole (0,0, 1) and the 


south pole (0,0,—1). The corresponding tangent vectors are the y and @ partial derivatives, 
respectively. Writing them as column vectors we have 


OF cos y cos 6 OF —sinysin@ 
Bo 7 «| cos” sin 0 and or sin y cos 0 
a —siny 0 


Notice that these vectors satisfy the relations 


la [= * [ar|= 9% arom 


00 


Here’s a sketch: 





sin y cos 6 
Notice also that these vectors are indeed orthogonal to the normal vector | sinysin @ | to 
cosy 
the surface (this was guaranteed to occur). 

The spherical coordinate situation generalizes to R”. This is easily accomplished by induc- 
tion on n, the general step being just like the transition from R? to R*. Thus suppose x € R” 
is represented as x = (2’,x,), where x’ € R"1 and x, € R. Suppose that x’ 4 0 and that we 
already know how to represent x’ in spherical coordinates for R"~!. As usual, let 






























































r= (ell. 


Then 
Ln Ln 


p Val? + 27 
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and thus this quotient is in the open interval (—1,1). It is therefore equal to cosy,_2 for a 
unique 0 < ~,-2 < am. This is the “new” angle in the spherical coordinate system for R”. 
Then 





le’? = P-e = r?—9? cos* yao = 1’ sin? Yp-5, 
so we have 
la’ || = sing. > 0. 
We then have the relevant angles »,..., Qn—3, 9 and formulas for 2’, using its norm r sin ~y_2. 











PROBLEM 6-19. Show that points in R* can be represented as 





“1, =rsingyesin y, cos§, 
L2 =rsingesinyg, sing, 
23 =1rsiny2cos ¢1, 

L4 =P7CoOsa, 


where 0 < gy, < 7,0 < Yo <7,0<r < ow, and @ is determined up to integer multiples 
of 27. Show that the condition for x € R* to be so represented is simply x? + 73 £ 0. 
Show also that if F'(r, Y1, Y2, 0) represents the above formulas, then the first order partial 
derivatives are mutually orthogonal, and 














OF 
a Mi ee Ah 
Or , 
| OF 
— = tT; 
Ov 
OF : 
—— || = rsin 
dvi 2, 
OF || 
30 = rsing2sin Y}. 
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PROBLEM 6-20. Consider a curve R 4 R? of class C'!, represented in spheri- 
cal coordinates by naming the spherical coordinates of f(t) as r(t), y(t), O(t) (abuse of 
notation). Prove that 





If Oll = yr? +19? + r2sin’y 6, 


This relation is often expressed symbolically as 


(ds)? = (dr)? + r?(dy)? +r? sin%y (d0)?. 





PROBLEM 6-21. What is the corresponding result for a polar coordinate represen- 
tation of a curve in R?? 
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PROBLEM 6-22. There is a particularly interesting class of curves on a sphere in 
R°. Assume you are dealing with the unit sphere ||z|| = 1, so that the two angles y, 6 
serve as coordinates. Then of course 





(ds)? = (dy)? + sin*p (dé)?. 


Let 0 < a < 5 be fixed and consider a curve on the sphere “going north” at a constant 
angle a with the meridian. That is, if the curve is given as f(t), 





ca) e (— cos y cos 6, — cos ysin 8, sin y) = cosa. 
If’) 
Such a curve is called a lorodrome. 
a. Show that 
y= cosa) y? + sin’ 6”. 
b. Show that 


y =+cotasing 6. 


c. Assume that y’ < 0 (going north) and &’ > 0 and solve the differential equation in 
(b) to express 6 as a function of y. 
d. Show that the loxodrome exists for —oo < 6 < oo and that its total length is 7 seca. 



























































As we mentioned just before the example, the parametric representation of M gives a 
very nice understanding of the tangent space T,,,M. For if F(to) = wo, then the vectors 
OF /0t;(to) are in TM, an m-dimensional space. Thus the linear independence of these m 
vectors guarantees the situation that every vector in T’,,M is a unique linear combination of 


OF /Aty,...,OF/Otm. 


In Section C we discussed the implicit presentation of a manifold and the crucial role 


Implicit function theorem 21 


played by the Jacobian matrix of the defining function. The parametric representation we are 
now considering has a corresponding Jacobian matrix. Namely, in properly understanding the 














parameterizing function R™ 4, R” we introduce the Jacobian matrix of F: 





ari oF 
oti aioe Otm 
DF =| : : 
OF n OF n 
Oty ais Otm 


It is the m columns of this matrix that are required to be linearly independent. Thus DF has 
maximal rank. 

In the notation of Section A, if M is also represented implicitly by R” > R* as the set 
of points satisfying g(x) = 0, we have 











R” 4 RR 


yor = 0, 











’ 


Thus the chain rule says exactly that 


kxn nxm 


The implicit function theorem is once again important in the present context. It shows 
that a parametric presentation of M generates (in theory) an explicit presentation. We still 











work with the above notation R™ 4, IR”. 
We then introduce the new function 


R™ x R" , R” 























defined by 


Pia) = FO) =s. 
Note that F(to,79) = 0. Also note that the Jacobian matrix DF has the m+n columns, 


ar ar. 
Gti Oba 


The first m of these are linearly independent, by hypothesis. A standard linear algebra 
technique allows us to start with the vectors —é,,...,—€é,, and replace some of them one at 


22 Chapter 6 





a time by all of OF /0t,,...,0F /Otm, and still have a basis for R”. For ease in writing, let us 


say that —é,,...,—€m have been replaced. Then the columns 
OF OF : . 
ey a Emit, ++, —En 
Ot, Otin — 





are a basis for R". Thus the implicit function theorem of Section C allows us to assert that 
the equation 
Fij=% = 0 


can locally be “solved” for t1,...,tm, 2m41,---,£n as functions of the other variables 71, ..., Zn. 
In particular, we obtain near x 


gEM => G44 = UG oca, Sm) for 1<i<k. 


(The theorem also guarantees that the parameters t,,...,t, are also functions of x1,...,2m, 
but we don’t need that extra information here.) Thus the manifold is again represented 
explicitly. 

Here is how the above analysis works in the case of a parametrized surface M C R?. The 
points of M might be represented as depending on parameters t, and ta: 





C= Pitta) 
y = glti,te), (*) 
Zz = h(ty, te). 


We would assume that the corresponding Jacobian matrix have rank 2. For instance, suppose 
that 
Of of 
det Se Se #- U. 
Ot, Ote 


Then we could in principle locally solve the first two equations in (*) to produce for points in 
M 


’ 


And then we have in turn 
z=h(al(z,y), b(x,y)), 


so that M is locally represented explicitly. 


Implicit function theorem 23 


F. The explicit bridge 


This section consists primarily of observations about what we have already achieved for an 
m-dimensional manifold M Cc R". We have thoroughly discussed both an implicit description 
and a parametric description of M. A local explicit description presents M by giving n —m of 
the coordinates of IR” as “explicit” functions of the other m coordinates. For instance, locally 
we might have the presentation 








CCM > tag = Oil Sig e.cs Sm) for 1<i<k=n-—m. 


This presentation is clearly both implicit and parametric, simultaneously. It is interesting 
that to go from a general implicit or parametric presentation of MM to the opposite kind, we 
used the implicit function theorem in each case to find an explicit presentation. Thus all three 
types of presentation of M are equivalent, at least in theory. 

The phrase “in theory” is of course quite important, as we may not be able to solve the 
equations in closed form, or we may not care to do so. The famous folium of Descartes is a 
case in point. This is a one-dimensional manifold (except for one bad point) in R?. Its natural 
and famous implicit presentation is this: 





o+y> = 3ay. 


Notice the gradient of the defining function is (32? — 3y,3y? — 3x), and this is nonzero on M 
except at the origin. Here’s a plot of the folium: 


y 


Notice the singular point (0,0). 


In this example a “trick” produces a parameterization. Namely, consider the intersection 
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of the curve with the line y = tz of slope t. Then 
ge +e? = 3atex. 


Discounting the origin, we can divide by x7: x + t3x = 3t. Thus 


3t 
a TB’ 
_ 38 
oS Tae 
This formula makes sense for all t 4 —1 and produces all points on the folium of Descartes. 
It is easy to check that the velocity vector (4, dy) is never 0. Thus for this manifold we 


are able to go from implicit to parametric presentation without stopping to find an explicit 
representation at all. This is quite fortunate, as solving x? + y? = 3ry for y as a function of 
x, or solving x = os for t as a function of x are both tremendously difficult tasks, involving 
the solution of cubic equations. 


PROBLEM 6-23. The sketch shows that the folium of Descartes has a vertical 
tangent at a point in the first quadrant. 
a. Find this point by computing and using dx /dt. 
b. Find this point by using Lagrange multipliers to find an intrinsic critical point of x 
on the folium. 


PROBLEM 6-24. Consider the “figure 8 curve” of Problem 2-6, and its polar 
representation r? = cos 26. Show that the length of this curve equals 





m/2 dt 
2 ; 
/ Vcost 


PROBLEM 6-25. Consider the “cardioid” given in polar coordinates as r = 1+cos 6. 
Sketch this curve in the x — y plane and show that its length equals 8. 


G. The derivative reconsidered 


We can now give a brief discussion of a vast but easy generalization of the concept of 
derivative. Suppose that we have two manifolds N and M and a function f from N to M: 
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N is n-dimensional, N Cc R%; 





M is m-dimensional, M C R?; 


NM. 


’ 
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As we want only to introduce the concepts, we assume that f is differentiable, without actually 


giving the precise definition. 


Then consider a point x9 € N and the image yo = f(xo). We have the two tangent spaces 
at our disposal, T,,,.N and T,,M. If we think of these as tangent vectors of curves, then we 


can see how f can be used to map T;,,N into T,,M. 











ad 








R? such that 





Namely, suppose h € T,,, N. Then there is some curve R 


y(t) € N for all t, 
7(0) = Xo, 
7'(0) = h. 


But then the composition f oy is a curve into M such that 
fo 7(0) = yp. 


Thus its tangent vector 
(f o7)'(0) 


is a vector in T,,M. 
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DEFINITION. In the above situation, Df (xo) is the mapping from T),,N to T,,M given 
by the formula 


Df (x0) (9'(0)) = (f 97)"(0). 


That’s the idea. Of course, we have not made this at all rigorous. We would have to 
define the concept of differentiability of f and also would have to show that the definition of 
Df (xo)(h) is independent of the choice of curve 7 with y/(0) = h. All of that can be done 
quite readily, and it also follows that Df (xo) is a linear mapping from T,,.N to T,,M. 

Thus in a very precise sense D f(a) represents an affine approximation to the original 
mapping f. 

By the way, a more standard notation is f,(%9) = Df(xo), and f,(xo) is said to “push 
forward” tangent vectors. 


The next problem is an easy exercise illustrating this concept. 


PROBLEM 6-26. Consider a cartographer’s attempt to map a portion of the unit 
sphere S(0,1) C R? onto a flat piece of paper R?: 




















S(0,1) 5 R®. 
Use standard spherical coordinates on S(0,1) to write this in the form 
f (sin ycos 9, sin y sin 0, cosy) = (u(y, 8), u(y, 8)). 
Prove that the linear map Df (2o, yo, 20) maps the orthonormal basis vectors as follows: 
(cos Yo Cos Oo, Cos Yo Sin A, —sin Yo) to (Uy, Vy) 


and 
(— sin ,cos 6,0) to csc yo(us, v9). 
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PROBLEM 6-27. An interesting cartographic mapping is the gnomonic projection. 
Here is one way to view it as a function from R? to S(0, 1): 


(x,1) 











re (z,1) _  (@1) 
(za? +41 


a. Show that the inverse of this mapping is given by 


b. Show that the corresponding functions in the preceding problem are 





u(y, 0) = tan ycos8, 
u(y, 0) = tany sing. 





c. Prove that every great circle on the sphere gets mapped to a straight line in R?. 


(You can therefore imagine such a map might be useful in flying along great circle routes. ) 











H. Conformal mapping 


We continue to consider a mapping N !, M of one manifold into another, but now we 
assume that the two manifolds have the same dimension. One thing that could be asked is 
the question of what effect the mapping f has on angles. In particular, whether f preserves 
angles. 

This really comes down to a question about the derivative Df(xo), which is a linear 
mapping from the tangent space T,,, N to Tp(2,)M. We shall assume that Df (xo) is invertible, 
so that h 4 0 => Df(xo)h £0. Then the fact is that f preserves angles at xo if and only if 


Df(xo)he Df(xo)h' heh’ 


|DF(xo)hl] IDF (wo)h'| (Al A’ 
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for all nonzero vectors h, h’ € T,,,.N. This is of course due to the fact that the cosines of the 
relevant angles are measured by the respective expressions. 

To say the least, the verification of such an equation appears unwieldy. However some 
elementary linear algebra comes to our rescue. To see the relevance of the following result, 
notice that the linear mapping Df(xo) maps the vector space T,,N into the vector space 
T r(x9)M and these two vector spaces have the same dimension. We can therefore choose 
orthonormal basis for them and can thus represent Df (xo) by a square matrix. The following 
theorem about n x n matrices is therefore just what we need. 


THEOREM. Let A be a real invertible n x n matrix. Then the following conditions are 
equivalent. 


(1) A preserves angles. 
(2) A preserves right angles. 


(3) A is a positive scalar multiple of an orthogonal matric. 











(4) A maps some orthonormal basis of R" to a nonzero scalar multiple of an orthnormal 
basis. 





PROOF. Of course, (1) is exactly the sort of condition we are interested in. In symbols, it 
says that for any nonzero h and h’ € R", 


Ahe Ah’ heh! 
JARI AR [ell Neel 


Our strategy is to prove that (1) = > (2) => (3) = (4) = (1). All of these are 
straightforward except for (2) => (3). 

(1) => (2) is clear, as (2) means that heh’ = 0 = > Ahe Ah’ =0. 

(2) ==> (3) requires some calculation. If h, h’ are arbitrary nonzero vectors, then h is 
orthogonal to 














, heh 
I[P||? 
(remember p. 1-147), so (2) gives 


ae (an = mae") 2G, 


Thus bleh 
Ahe Ah! = Tae aa’. 
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Interchange h and h’ 
Ane Ah = 8 Wann. 
I[P’|| 
We conclude that VARI? |ARIP 
h'eh ( _ ) = 0. 
|||? I|r’||? 


JAh|| _ NAR 
al 


Therefore if h’eh 40 
Il 
h = 0, then we can use the equation we have just proved for the vectors h and 


h +h’ and then also for the vectors h’ and h + h’, and conclude that it still holds. Thus, for 





An‘, 
ie] 


In case h’ 


all nonzero h and h’ € R", 
AAI 
I|-| 








R”. 





Call this common ratio c. As A is invertible, c > 0. Thus 
lc AA|| = |||] for all he 
According to Problem 4-20f, c~'A = ® € O(n). Thus A = c® and (38) is verified 
(3) ==> (4) is clear, for if {@;} is an orthonormal basis, then so is {®¢;} for any ® € O(n) 
Let {¢;} be the particular orthonormal basis such that {cA¢;} is also an 
Let ® and W be the corresponding orthogonal matrices having those 


(4) => (1). 
orthonormal basis 
columns: 
® = (1 G2 .-- Gn); 
UV = c(Agi Ads tae AQy,)« 
Then we have the matrix equation 
UW =cA®, 
so that 
A=c'We@}, 
Thus A is a scalar multiple of an orthogonal matrix and we conclude that A preserves angles 
QED 
Now we return to our discussion of N 4 M. Our question about the preservation of angles 


by f now is answered by the condition that at each xo the derivative Df (zo) is a positive scalar 
That is, according to Problem 4—20f, we need 
for all h € T),,.N. 


multiple of an “orthogonal matrix 
|| DF (a0) hl] = (wo) lA 
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The local magnification factor c(xo) is of course allowed to vary with xo. Here is the summary 


DEFINITION. Suppose NV Misa map from an n-dimensional manifold N to an 
n-dimensional manifold M. Then f is a conformal mapping if it preserves angles. That is, if 
for every x € N there corresponds a local magnification factor c(x) > 0 such that 


||Df(x)Al| = c(x)||h|| for all h € TLN. 











For instance, a linear mapping from R” to R” has the form x — Az for a certain n x n 
matrix A. This mapping is conformal if and only if A is a positive scalar multiple of an 
orthogonal matrix, and the magnification factor is that scalar. 

In the general case the conformal function f will introduce some “distortion” in the way it 
moves points of N to M. Though f distorts lengths, it leaves all angles exactly unchanged. If 
it so happens that the local magnification factor c(x) is constant, then the mapping introduces 
no distortion at all. In particular if c(x) = 1 for all x, we would even say that f is an isometry. 

For example, the right circular cylinder x? + y? = a? in R? is locally isometric to R?. This 
can of course be readily seen from a polar coordinate representation of the cylinder by defining 

















f(acos6,asin6,z) =(a0,z), 0<@<2m7, -wo<z<o@. 


Then 


Df (acos@, asin @, z)(—asin 0,acos 0,0) = (a, 0), 
Df (acos 6, asin 6, z)(0,0,1) = (0,1). 
Thus Df maps the orthonormal basis (— sin @,cos 6,0), (0,0,1) (of the tangent space to the 


cylinder) to the orthonormal basis 7, 7 (of R?). The picture to keep in mind is that of cutting 
a paper cylinder along a straight line parallel to its axis and unrolling it to make it flat. 
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This mapping f is thus an isometry. 


PROBLEM 6-28. Any right cylinder, circular or not, can be isometrically flattened. 
Show this by using a curve 7 in R? which has unit speed and defining the cylinder to be 
the set of points of the form 














C=). 2) |e <t< b, oo < 2 <0}. 











The choice of C i, R? should now be clear. 
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PROBLEM 6-29. A right circular cone can also be flattened isometrically (locally). 
Show this by using a cone of the form shown here: 


(side view) 





z= cotar/z? + y? 
Use cylindrical coordinates for R? so that the cone has the parametric representation 


= ztanacos86, 
= ztanasin8, 


a = Bs 


Then “cut” the cone along 6 = 0 and flatten it out into the form 


with the mapping from the cone to R?: 


f(ztanacos 6, ztanasin@, z) = zseca(cos(@ sin a), sin(@ sin a)), 


Prove that f is an isometry. 


PROBLEM 6-30. Consider the two points (3,4,5) and (4,3,5) on the cone z = 
\/x2 + y?. What is the smallest possible length of curves joining these two points and 
lying entirely on the cone? (Use a calculator.) 

[Answer: 1.41659... ] Notice that it’s greater than 2. Comment? 
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Notice very particularly that this flattening of the cone preserves all angles. We are ex- 
cluding the vertex, and, indeed, angles seem to be distorted there. 


I. Examples of conformal mappings 


We now present important classical instances of conformal mappings, all of which are 
extremely important in many contexts. 


1. Holomorphic functions 
Surely the most basic possible situation is that of a function R? /, R?. Both manifolds are 
the Euclidean plane. Let us denote the Cartesian representation of f as 











f(x,y) = (u(z,y), v(x,y)). 
Then the derivative D f(x,y) maps the standard orthonormal basis as follows: 


Df(x,y)i = (Us, Ue), 
Df(@,y)J = (Uy, vy). 


Thus the condition that f be conformal is that the vectors (uz, v,) and (uy, vy) have the same 
(nonzero) norm and be orthogonal: 


2 Qo. ges 2 2 
u,+u, = u, tu, £0, 


Ugly + Ugdy = 0. 


If for example u, 4 0, then we have uy = —v,v,/uz, so if we write t = v,/u, then we have 
tu, = Vy, 
Uy = —tvz. 
And then 
uztour = Put Pur 


= (ue +2). 
Thus 1 = ?? and we have two cases. If t = 1, then the condition is 


du _ dv 
Ox ——Oy? 
Ou __ dv 


Oy Ox" 
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These equations are known as the Cauchy — Riemann equations. They are surely the 
most important partial differential equations in all of mathematics. A solution of them is said 
to be a holomorphic function; rather, the corresponding complex-valued function 


f(a + ty) = u(x, y) + iv(z, y) 


is said to be holomorphic (or analytic). As we have seen, in case u2 + us # 0, f is also 
conformal. 

The other case t = —1 is entirely similar, the corresponding equations being uz = —vy, 
Lig. =e 

A good example is the function which represents the square of a complex number. As 
(x + iy)? = x? + 2iry — y’, we define 


Fla.) = (a? ~~ yy 20y): 


This is seen to be holomorphic, with local magnification factor 2,/x? + y?. This mapping is 
thus conformal except at the origin. Indeed, angles are doubled at the origin. 

The study of holomorphic functions has been going on for centuries. In fact all universities 
devote at least one entire semester to this subject, in courses titled “complex analysis,” or 
“complex variables,” or “analytic functions,” etc. At Rice these come as either MATH 382 or 
MATH 427. Both are highly recommended. 


2. Inversion in a sphere 


We have already looked at this in Problems 2-79, 80, 81. The function R” — R” is given 
by 




















x 


I(@) = Ty: 


Its derivative is 
Df (a) = |la||-? — 2||a||-*(xex,), 


and we have the result 
Df ()Al| = |la|-7 AI. 


Thus f is conformal with the local magnification factor ||a||~ 

















PROBLEM 6-31. Asa possible generalization, let R” — R” be given in the form 


f(x) = gel). 


Compute Df(x) and show that f is conformal ==> g’(t) = 0 or g/(t) + 2g(t)/t = 0. 
Conclude that g is constant or g(t) = ct~?. 
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PROBLEM 6-32. Wecan perform inversion in any sphere. If the sphere is centered 
at 2 and has radius R, the correct formula is 


XL — Xo 


f(x) =X%o+ ie 


IIx — £o||?" 





Prove that the mapping is conformal. What is the local magnification factor? 
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PROBLEM 6-33. Reflection across a “plane” in R” is a somewhat simpler sort 
of mapping than inversion in a sphere. Suppose the “plane” (really, hyperplane) to be 
described as follows: let @ be a unit vector in R” and c € R, and define 






































M ={xER"|rzet=c}. 





Schematic figure: 


ab @ 





d 


f@) 











Then we map any x € R” to its “mirror image” f(x) with respect to M. That is, we 
determine t € R such that 

















f(a) =2+ta 
Mose EM. 


a. Prove that f(%) =x +2(c— re t)iu. 


b. Prove that 
Df (a;h) =h—2he tit. 


c. Prove that f is conformal. 


3. Mercator projection 


This is a cartographic exercise coming from the desire to map a portion of the unit sphere 
S(0,1) C R® in a certain way onto a sheet of paper R?, so that the meridians 6 = constant get 
mapped into equally spaced parallel lines and the mapping is conformal. Thus, using standard 
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spherical coordinates for the sphere, we are led to S(0, 1) /, R? of the form 





f (sin y cos 9, sin ysin 0, cos y) = (6, g(y)), 


where g is to be determined. Then we compute that the linear mapping Df has to transform 
the corresponding unit tangent vectors as follows: 


(cos y cos 6, cos y sin 8, — sin y) > (0, 9’(v)), 
(— sin 8, cos 0,0) += esc y (1,0). 


The resulting two vectors are orthogonal, so the condition for conformality is that they have 
the same norm: 


lg'(y)| = esc y. 


PROBLEM 6-34. Assume that g increases as the north pole is approached (y 
decreases to 0) and that the equator ~ = 5 gets mapped to the @-axis. Then show that 


g(v) = log(csc p + cot yp) = log cot 3 


This formula gives the famous Mercator projection: 


f (sin y cos 6, sin y sin 6, cos y) = (0, log cot *). 


PROBLEM 6-35. Show that the local magnification factor of the Mercator projection 
is CSC yp. 


PROBLEM 6-36. — Show that the loxodromes (Problem 6-22) on the sphere, which 
are the curves that maintain a constant angle with the meridians, correspond to straight 
lines under the Mercator projection. 


This problem illustrates one navigational use of a Mercator projection. If you want to sail 
from A to B on S(0,1), use your Mercator map to locate A and B, connect them with a 
straight line, then sail at that constant bearing. The resulting journey will of course be longer 
than the great circle path. 
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PROBLEM 6-37. The coordinates of Houston, Texas, are 29.97°N latitude, 95.35°W 
longitude. The coordinates of Samara, Russia, are 53.23°N and 50.17°E. Assume the 
earth is a ball with radius 3960 miles. 

a. Find the great circle distance between these cities. 

b. Find the “loxodromic distance” between them. 


Blondie 


HEY, L ONLY SAID HE WAS 
A MAPMAKER, | R SAI 





uy 


AI 
Mn 


A 
Wet 
22'S 


ere 


PROBLEM 6-38. — Consider a “loxodromic” “triangle” on the unit sphere. Explain 
what this should mean, and prove that the sum of the interior angles of such a “triangle” 
equals 7. 
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4. Stereographic projection 
In this cartographic exercise we again map 


S(0,1) 5 R?, 





but now the desire is that the meridians 6 = constant near the north pole themselves get 
mapped to the straight lines 6 = constant in a standard polar coordinate representation of 
R?. Thus we have for an as yet unknown polar coordinate r(y), 





f (sin y cos 9, sin y sin 0, cos y) = (r(y) cos @,r(y) sin 8). 


As we want the north pole to map to the origin, we assume r(0) = 0. 


PROBLEM 6-39. Show that this mapping f is conformal => 


" 
——' =P CseY. 


dip 


Conclude that : 


te) csc p + cot yp 


for some positive constant c. 


PROBLEM 6-40. Take c = 1 in the preceding problem. Let points in S$(0,1) be 
represented in Cartesian coordinates as y = (yi, Y2, y3), and let 


f(y) = (@1, £2). 


Show that 


Y1 
1+ ys’ 
Y2 
i+ ys. 





Ly = 





r2 = 





The function f of this problem is called stereographic projection of the unit sphere onto 
R?. 
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PROBLEM 6-41. Stereographic projection provides a nice result concerning loxo- 
dromes. Consider a loxodrome as described in Problem 6-22, described by the relation in 
spherical coordinates, 

6 cot a = log(csc y + cot y). 











Using stereographic projection, this loxodrome becomes a curve in R?. Show that the 
equation of this projected curve in the usual polar coordinates for R? is 

















r= e Pct a 


This plane is called a logarithmic spiral. 


REMARK. Martin Gardner’s delightful book, aha! Insight, Scientific American 1978, has 
an interesting discussion in the Section “Payoff at the Poles.” Here’s a quotation from p. 43: 
“Here is a different navigational problem that involves a fascinating curve on the sphere known 
as a loxodrome or rhumb-line. ... A loxodrome, plotted on a flat map, has different forms 
depending on the type of map projection. On the familiar world map called the Mercator 
projection, it is plotted as a straight line. Indeed, this is why a Mercator map is so useful to 
navigators. ... When a loxodrome is projected on a plane parallel to the equator and tangent 
to a pole, it is an equiangular or logarithmic spiral.” 


Notice that Martin Gardner’s discussion should have been more precise at the end, by 
stating that the projection used is stereographic. 
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PROBLEM 6-42. Here is asimple geometric way of defining stereographic projection. 
Work with the unit sphere 


S(0,1) CR" 


and denote its points with Cartesian coordinates: 


y= (yi, see ‘tha thisea) — (y’, Yn+1); 


where y’ € R”. We then set up a function from all of S(0,1) except for the south pole 
—é€nii = (0,—-1) to all of R” by means of starting with y, constructing the straight 
line determined by y and —é,,,;, and finding its intersection with the “equatorial plane” 
R” x {0}: 


S(0,1) 





a. Prove that 


(This agrees with what we found earlier in case n = 2. 
b. Prove that the inverse is 


y= Pe) =( ft ): 


1+ fal? 1+ |e)? 





c. Prove that f is conformal with local magnification factor (1+ Yn41)~!. (In computing 
|| D f(y)h|| remember that yeh =y! eh! + Yniihns = 0.) 
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PROBLEM 6-43. Here is yet another way to analyze stereographic projection. 
Consider the inversion mapping of R"*+! with respect to the sphere S(—én41, V2) of radius 
/2 centered at —En41: 
a ¥y “- aa | n+l 
f(y) = —ényi + 2————.,, yE R"™. 
: ee earl” 





We know that f is a conformal mapping from R”t! to R"t!. Prove that when restricted 
to the unit sphere S(0,1) f equals the stereographic projection of Problem 6-42. 


PROBLEM 6-44. Sometimes it is convenient to alter the stereographic projection by 
projecting onto a hyerplane other than the one we have been using. Continue to project 
from the south pole —é,41, but now project onto any hyerplane orthogonal to the vector 
—én41. Denote the new hyerplane as M = {(z,a) | x € R"}, where a 4 —1. Let f, denote 
the corresponding stereographic projection onto R”. 
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Prove that f, is conformal by showing that f, = (a+1) fo. (The two most common choices 
for a are 0 and 1.) 





For obvious reasons there has been an enormous amount of work dedicated to conformal 
mappings of portions of $(0,1) C R? into the Euclidean plane R?. You should be aware that 
no such map can exist with no distortion. 
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PROBLEM 6-45. Prove that there is no isometry of any small region of $(0, 1) 
into R?. For instance, no matter how small € > 0 is, there is no isometry of {(z, y, z) | 
ety +22=1,1-e< 2 <1} onto a region in R’. 





(HINT: you might want to think about small circles in S(0,1).) 
5. Lambert conformal conical projection 


PROBLEM 6—46*. 


a. Find a map of the USA which displays the Lambert type of projection used. 
b. Write an essay about this projection, complete with formulas and proofs. 





